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ABSTRACT: 

Suppose a sequence of random variables {X n } has negative drift when above a certain 
threshold and has increments bounded in LP. When p > 2 this implies that EX n is bounded 
above by a constant independent of n and the particular sequence {AT n }. When p < 2 there 
are counterexamples showing this does not hold. In general, increments bounded in LP lead 
to a uniform L r bound on X + for any r < p — 1, but not for r > p — 1. These results are 
motivated by questions about stability of queueing networks. 
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1. Introduction. 



Let Xq,X\, ... be a sequence of real- valued random variables. We wish to find a condition, 
along the lines of behaving like a supermartingale when sufficiently large, that will guarantee 
sup n EX n < oo. In particular, we do not wish to assume any special properties of the 
increments such as independence, r-dependence, Markov property, symmetry, discreteness 
or nondiscreteness. Under what conditions can we guarantee that sup„ E(A n ) < oo? 

Specifically, we suppose that for some a > and some J we have 

E(X n+1 - X n j X , . . . , X n ) < -a on the event {X n > J} (CI) 

for all n. That is, the process has negative drift whenever it is above the point J. This 
condition alone says nothing about possible large jumps out of the interval (— oo, J), so we 
also assume that for some p > 1 and some V < oo we have 

E(\X n+1 -X n \P\X ,...,X n )<V (C2) 

for all n. That is, the process has increments with bounded p th moments. Conditions (CI) 
and (C2) are meant to characterize the behavior of sequences attracted to a basin, which 
always decreases in expectation except when it is already small. (One such example is a 
nonnegative Lyapunov function of a Markov chain; see for example Meyn and Tweedie, 
1993.) Do these two conditions together imply that sup„E(A n ) < oo? 

When p = 1 the answer is no. Although an honest supermartingale has EX n < EAo for 
all n, a process behaving like a supermartingale above J may have EA n unbounded. More 
surprisingly, the answer is still no for p = 2. However, if one assumes (CI) and (C2) with 
p > 2, then necessarily sup n EX„ < oo. More generally, we show (Theorem 1) that (CI) 
and (C2) imply that sup n E((X+) r ) < oo whenever r < p — 1, and that this bound on r 
in terms of p (or p in terms of r) is sharp. Furthermore, our bounds are may be explicitly 
computed and depend only on the parameters a, J, p, V, and r. 

Our results are motivated by questions about queueing networks. Specifically, several 
authors (Borodin et al., 1996, 1998; Andrews et al., 1996) consider network loads under 
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the influence of a stochastic adversary. Here X n is the load of the network at time n. 
The adversary may add new packets to the network in virtually any manner, subject only 
to a load condition which leads to (CI) plus a moment condition such as (C2). (The load 
condition corresponds to the statement that, once the network is operating at full capacity, it 
processes packets more quickly on average than the adversary can add them.) The network 
is considered to be stable if the expected load remains bounded, i.e. if sup ra E(X„) < oo. In 
this context, our Corollary 2 may be interpreted as saying that a queueing network in the 
presence of a stochastic adversary is guaranteed to be stable, provided it satisfies the load 
condition (CI), and also the moment condition (C2) for some p > 2. On the other hand, if 
p < 2 then there is no such guarantee. 

We note that there has been some previous work on related questions. For example, Has- 
tad et al. (1996) consider bounds on sup n E(X„) for certain time-homogeneous Markovian 
systems which correspond to particular "backoff protocols" for resolving ethernet conflicts. 
Close to our work, Hajek (1982) investigates bounds on hitting times for general random 
sequences having bounded exponential moments, and derives corresponding bounds on ex- 
ponential moments of the hitting times; his work may thus be seen, roughly, as the p — > oo 
limit of our bounds. 

Finally, we note that while the notion of "stability" considered here (namely, that 
sup n E(X n ) < oo) is different from that of Markov chain stability (see e.g. Meyn and 
Tweedie, 1993), there are some connections. For example, it is known (see Tweedie, 1983, 
Theorem 2) that for k G N, if {X n } is an aperiodic Harris-recurrent time- homogeneous 
Markov chain having stationary distribution tt(-), and if = / x k ir(dx) < oo, then for 
7r-a.e. x, E<s x (Xn) — > m, and hence sup n E ( 5 x (X^) < oo. In other words, for such a Markov 
chain, stability in our sense is implied by standard Markov chain stability. In fact, it is 
known (e.g. Tuominen and Tweedie, 1994) that when {X n } is a random walk with negative 
drift, reset to zero when it attempts to leave the nonnegative half- line and having square 
integrable increments, then ~Es x (X n ) will converge and hence be bounded. This shows that 
our (CI) and (C2) do represent a greater generality than the random walk context. 
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The paper is organized as follows. In Section 2 we state the main result, along with two 
extensions. (The extensions are reasonably straightforward, but we include them in order 
to provide readily referenceable results that don't assume more than is needed.) We also 
provide in Section 2 a simpler proof of the main theorem in the case where p > 4 and r = 1, 
since in this case the back-of-the-napkin computation works, and anyone not interested 
in the sharp moment condition need read no further. In Section 3, we give examples to 
show why (C2) is needed with p — 1 > r and why it is important to have moment bounds 
for the negative part of the increments as well as the positive part. Proofs are given in 
Sections 4 and 5, with Section 4 containing a reduction to a result on martingales and 
Section 5 containing a proof of the martingale result. 



2. Main results. 



Throughout this paper, the filtration {Tn} refers to any filtration to which {X n } is adapted. 
We continue to use (CI) and (C2) for conditional expectations with respect to jF n , slightly 
generalizing the notation of the introductory section. 

Theorem 1 Let X n be random variables and suppose that there exist constants a > 0, J, 
V < oo, and p > 2, such that Xq < J, and for all n, 

E(X n+ i — X n | Tn) < —a on the event {X n > J} (CI) 

and 

V(\X n+1 -X n \P \X ,...,X n ) <V (C2) 
Then for any r G (0,p — 1) there is a c = c(p, a, V, J,r) > such that E(X+) r < c for all n. 

Applying this theorem to the process X' n := X n — {Xq — J) + in the case r = 1 immediately 
yields the corollary: 
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Corollary 2 Under hypotheses (CI) and ( C2) of Theorem 1, but without assuming Xq < J, 
we have 

E(X n | T ) < c(p, a, V, J, 1) + (X - J)+. 

Remark. By following through the proof (presented in Sections 4 and 5), we are able to 
provide an explicit formula for the quantity c(p, a, V, J, r) of Theorem 1. Indeed, for a = 1 
and J = 0, we have c(p, 1, V, 0, r) = K((p — r) where K = C(b,p, r) = TP^ciC '(p, b) + C4. 
Here b = 2P(B + (1+B)p); B = V(l+V); C'(p, b) = max(l, c'{p, b)); c'(p, b) = c^l+c- 1 ^; 
c 2 = c p b(4P + 4f-^); c 4 = C'(p, b)c 3 a P /2); c 3 = 3^6(^6 + ^ + 3 r ); and c p = (p - l)f 
is the constant from Burkholder's inequality. (Recall that C( w ) = i s the Riemann 

zeta function, finite for !fte(u>) > 1.) Then for general a and J, we have c(p,a,V, J,r) = 
J + a r c(p, l,V/a p ,0,r). Now, these formulae are clearly rather messy, and may be of limited 
practical use. However, it may still be helpful to have them available for ready reference. 

We also state an extension allowing the negative part of the increments to avoid the 
moment condition in (C2): 

Corollary 3 The conclusion of Theorem 1 still holds when X n+ \ — X n is replaced by 
(X n+ i — X n )lx n+1 ~x n >z n in conditions (CI) and (C2), and Z n < —a is any sequence 
adapted to {T n }. 

The proof of Theorem 1 proceeds by decomposing according to the last time U before 
time n that {X^} was less than J. When p > 4, Markov's inequality, together with a crude 
LP estimate on X n — Xjj, gives bounds on the tails of X n sufficient to yield Theorem 1. We 
finish the section by giving this argument. 

Assume the notation and hypotheses of Theorem 1. Fix a positive integer n. Let 
U = m&x{k < n; X^ < J}. Let = E (Xi + \ — | JFj), so that ^ < —a on {Xi > J}. We 
may recenter (see the proof of Corollary 5 for details) to obtain 

E((X n+1 -X n -fi n y \T n ) < V 
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for some V' < oo. 



But then, for t > J, we have 



n-l 



P(X„ > t) = £ P ( X n >t,U = k) 
k=0 

n-l 

< F ( x n -X k >t-J, X k < J, Xi > J for k < i < n) 

it — L / 

< Y P( (X n - X n ^ - Mn _!) + . . . + (X k+l -X k -fi k )>t-J- V Xlv + a(n-k- 1), 

1 — n \ 



n-l 



fc=0 



X k < J, Xi > J for A; < i < n 
[since fa < — a for k < i < n, and fi k < V l l p ] 

n—l , s 

<^E|ft-Vr^ 1 ) + ... + (%-x rW )| p (t-j-y 1 /p + a ( n -i-i)) 

fc=0 ^ ' 

[by Markov's inequality] 

n-l 



< J] c p y'(n - fc) p/2 (i - J - + o(n - k - 1)) 

fc=0 

[by Lemma 7, which is a direct application of Burkholder's inequality] 

oo 

< c p V '(Z + l ) P ' 2 (t-J- V 1/p + al) ~ P . 



It then follows that 



E(X n ) 



dtP(X n > t) 



p oo oo 

< (j + v 1/p + i) + / dty Cp v'(e + i) p/2 (t-j -v 1/p + a eY p 

JJ+VVp+I f^ Q V 1 

This integral-of-sum does not depend on n. Furthermore, for p > 4 it is straightforward to 
check (by integrating first) that it is finite. This gives the result. □ 



3. Some counterexamples. 



We here present a few counterexamples to show that the hypotheses of Theorem 1 (in 
particular, the restriction that p > 2) are really necessary. 

1. The following example is due to Madhu Sudan (personal communication via A. Borodin). 
Let {X n } be a time-inhomogeneous Markov chain such that P(^2 = 0) = 2/3 = 1 — 
P(X 2 = 2), and such that for n > 2, 

P(X n+1 = n + 1 1 X n = n) = 1 - 2/n 

P{X n+1 = 0\X n =n) =2/n 
P(X n+1 = n + 1 1 X n = 0) = l/n 
P(X n+1 =0\X n = 0) = l-l/n 
These transition probabilities were chosen to ensure that 

/ 0, P rob2/3 
I n, prob 1/3 

for all n>2. Hence, E(X„) = n/3, so that sup ra E(A / " n ) = oo. 

On the other hand, it is easily verified that (CI) is satisfied with a = 1 and J = 0. 
Furthermore, (C2) is satisfied with p = 1 and V = 3. We conclude that condition (CI) 
alone, or combined with (C2) with p = 1, does not guarantee stability. 

2. When (C2) holds with p = 2 it appears one has to do a little more to engineer a 
counterexample; specifically, we line up all the jumps out of (— oo, J) to amass at a 
fixed time M. Fix a large integer M, and define a time-inhomogeneous Markov chain 
by setting Xq = 0, and, for < n < M — 1, letting 

P(X n+1 = X n - 1 1 X n > 0) = 1 

P(X n+1 = I X n = 0) = 1 - (M - n)- 2 
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P(X n+1 = 2(M - n) | X n = 0) = (M - n)~ 2 

Then it is easily verified that (CI) is again satisfied with a = 1 and J = 0. Furthermore, 
(C2) is satisfied with p = 2 and V = 4. 

On the other hand, setting A = exp (— J2i^i V* 2 ) > 0' we compute that 

M 



E(X M ) = J2(M-k + l) P(X k > and Xj = for j < k) 

M 

> ^(M-fc + l)(^/(M-fc + l) 2 ) 



k=0 

M M+1 

= ^A/iM-k + l) = Yl A /j 

k=0 j=l 

which goes to infinity (like AlogM) as M —> oo. 

This shows that EX n cannot be bounded in terms of a, J and V, and by "string- 
ing together" such examples, for larger and larger choices of M, we can clearly make 
sup n E(X„) = oo. We conclude that condition (CI), combined with (C2) with p = 2, 
still does not guarantee stability of {X n }. 

3. ^,From the queueing theory perspective, it would be desirable, in condition (C2) of 
Theorem 1, to replace |X n +i — X n \ by [X n +i — i-e. to bound the p th moments of 

just the positive part of the increments. Intuitively, this would correspond to allowing 
arbitrarily large negative increments, and bounding only the large positive increments. 
The problem with this is that the process is not sufficiently affected by its negative drift 
when this is all concentrated into a few unlikely large jumps. We give a counterexample 
to demonstrate this. 

Fix < e < 1, and consider the following time- homogeneous Markov chain {X n }. Let 
Xq = 0, and for n > 0, let 

P(X n+1 = l\X n = 0) = 1 
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P(X n+1 = x + l\X n =x>0) = 1 - (1 + e)/(x + 1) 
P(X n+1 =0\X n = x>0) = (l + e)/(x + l) 

Then (CI) is satisfied with J = and a = e. Also, [X n+ \ — X n } + < 1, so (C2) would 
indeed hold (for any p > 0, and with F = 1) if we replaced \X n+ i — X n \ by [X n+ \ — X n \ + . 

On the other hand, it is straightforward to see that C(X n ) converges weakly to a sta- 
tionary distribution 7r(n), which is such that n(n) ~ Cn~ l ~ e as n — > oo. In particular, 
X)„n7r(n) = oo. It follows that E(X„) — > oo, i.e. that {X„} is not stable in this case. 
We conclude that Theorem 1 does not continue to hold if we consider only the positive 
part of X n+ \ — X n in condition (C2). 

Remark. This last counter-example only works when a < V l / p . In the case where X n > 
for all n, this appears to be an extremal counterexample, leading to the following open 
question: 

Does Theorem 1 continues to hold for sufficiently large a if we assume X n > 
and replace \X n+1 - X n \ by [X n+1 - X n ] + in (C2) ? 

Despite this counter-example, the hypotheses of Theorem 1 may indeed be weakened to 
allow some large negative increments. However, both condition (CI) and condition (C2) 
must be identically modified so that negative drift is still manifested. This is the motivation 
for having stated Corollary 3 as an extension to the main theorem. 

4. Reduction to a martingale question. 

We will derive Theorem 1 and Corollary 3 from the following martingale result. 
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Theorem 4 Let {M n : n = 0, 1, 2, . . .} be a sequence adapted to a filtration {J- n } and let 
A n denote M n+ i — M n . Suppose that the sequence started at M\ is a martingale (i.e., 
E(A ra | F n ) = for n > 1), and that Mq < 0. Suppose further that for some p > 2 and 
b > we have 

E(|A n n^„)<6 (1) 

for all n including n = 0. Let r = inf{n > : M n < n}. Then for any r € (0,p) there is a 
constant C = C(b,p,r) such that 

E ((M+) r l T>t ) < Cf-P. (2) 

We defer the proof of Theorem 4 until the following section. In the remainder of this 
section, we assume Theorem 4, and derive Theorem 1 and Corollary 3 as consequences. 

Corollary 5 Let {Y n } be adapted to {T n } with Yq < 0. Suppose E(| A' n \ p \ T r ^) < B for all n 
and E(A^ | T ri ) < for all 1 < n < a, where A' n = Y n+ \ — Y n and a = inf{n > : Y n < n}. 
Then for < r < p there is a constant K = K(B,p, r) such that 

B((Y t + ) r l a>t )<Kf-P. 

Proof: An easy fact useful here and later is that z + < 1 + \z\ p and hence 

B\Z\ P < b EZ+ < 1 + 6. (3) 

Recall (see e.g. Durrett 1996, p. 237) that the supermartingale {Y nAa : n > 1} may be 
decomposed as Y ntxa = M n — A n where {M n : n > 1} is a martingale and {A n : n > 1} 
is an increasing predictable process with A\ = 0. Let /u n denote E(A^|^> t ). Then the 
increments A n := M n+ \ — M n satisfy 

E(|A n r I Fn) = E(|A; - /x n n T n ) < 2PE(\A' n f + \^ n f \ T n ) < %>{B + (1 + Bf). 

Applying Theorem 4 to {M n } with b = 7P(B + (1 + Bf) and M := Y yields 

E ((M+yi T>t ) < Cf-P. (4) 
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When a > t it follows that M n > n + A n for 1 < n < t and hence that r > t. Also, when 
a > t, we know that M t = Y t + A t >Y t and therefore that 

Y+l a>t < M+l T>t . 

The conclusion of the corollary now follows from (4), with if = C(2 P (B + (1 + B) p ),p,r). 
□ 

The above argument uses no properties of the process A n , other than its being nonin- 
creasing and adapted. In particular, it need not be predictable. If the increments can 
be decomposed into the sum of two parts, one satisfying the hypotheses of the corollary and 
one nonincreasing and adapted, then the second of these can be absorbed into the process 
{A n } and the result will still hold. Without loss of generality, the second piece can be taken 
to be A^Ia' <z n for some adapted nonpositive {Z n }. In other words, the moment condition 
need not apply to the negative tail of the increment, as long as the mean is still nonpositive 
when the negative tail is excluded. We state this more precisely as the following corollary. 

Corollary 6 Let {Y n } be adapted to {T n } with Yq < 0. Let {Z n } be any adapted nonpositive 
sequence. Suppose E(| A' n \ p l/^i > z n \F n ) < B for all n and E(A^1a^>z„ \ Fn) < for all 
1 < n < a, where A' n = Y n+ \ — Y n and a = inf{n > : Y n < n}. Then for < r < p, 

E ((Y t + yi a>t ) < Kf-v. 

□ 

We now use these corollaries to derive Theorem 1 and Corollary 3. 

Proof of Theorem 1 from Corollary 5, and of Corollary 3 from Corollary 6: 
First assume that a = 1 and J = 0. Given {X n } as in the hypotheses of the theorem, fix 
an N > 1; we will compute an upper bound for E(A^) r that does not depend on N. Let 
U := maxjfc < N : Xk < 0} denote the last time up to N that X takes a nonpositive value. 
Decompose according to the value of U : 

N-l 

B(x+y = e ((x+y i u=k ) . 

k=0 



10 



To evaluate the summand, define for any k < N a process {Y^>} by Yr = (X k+n + 
n)lx k <o- In other words, if X k > the process {Y^} is constant at zero, and otherwise it 
is the process {^ n } shifted by k and compensated by adding 1 each time step. We apply 
Corollary 5 to the process {Y^}. Hypothesis (CI) of Theorem 1, together with the fact 
that X k+j > for < j < cr<», imply that E(A^ | T n ) < when 1 < n < crW . Also, 
E(|A;| P I T n ) < E(|l + X n+l -X n \ p \ T n ) <B := 2 P (1 + V). The conclusion is that 

v([(Y^%) + Yl a(k)>N _ k )<Kf-i> 

with K = K(V,p,r). But for each k, 

X%l U= k < y Af-fc 1 a('=)>Ar-fe 

and it follows that 

E((x+yi u=k )<K(N-ky-p. 

Now sum to get 

N-l 

E(X+) r <Y,K(N- kf-v < K((P ~ r). 

k=0 

This completes the case a = 1, J = 0. 

For the general case, let X' n = {X n — J) /a. This process is covered by the analysis 
of the a = 1, J = case above, with V/aP in place of V . We conclude that ~E(X' n ) r < 
c(p, 1, V/a 2 , 0, r), and hence that EX n < c(p, a, V, J, r) := J + a r c(p, 1, V/a p , 0, r). 

The proof of Corollary 3 from Corollary 6 is virtually identical. □ 

5. The proof of Theorem 4. 

We now concern ourselves with the proof of Theorem 4. We begin with two lemmas. 

Lemma 7 Let {M n } be a martingale with Mq = 0, and with increments bounded in L p : 

E(|M n -M n _i|P |^„_i) <L. 
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Then there is c p such that E|M n | p < c p Ln p l 2 . 



Proof: Burkholder's inequality (see Stout 1974, Theorem 3.3.6; Burkholder, 1966; Chow 
and Teicher, 1988, p. 396) tells us that for p > 1, there is a constant c p for which 

/ n \P/ 2 

B\M n \ p < c p B (J2( M k ~ M fc _!) 2 J . 

(In fact, for p > 2 we may take c p = (p — l) p , cf. Burkholder, 1988, Theorem 3.1.) For any 
Zi, . . . , Z n , Holder's inequality gives 

E|Zi + • • • + Z n \ p / 2 < n p l 2 max E|Z fc | p / 2 . 

l<fc<n 

Set Z k = (M k - M fc _x) 2 and observe that EZ£ /2 < E((E(Z fe | Tk-if' 2 ) < E(E|M fe - 
Mk-i\ p | Fk-i) < so the conclusion of the lemma follows. □ 

Lemma 8 Assume the notation and hypotheses of Theorem 4- For x > 0, let S x = infjfc : 
Mk > x} be the time to hit value x or greater. Then there is a C = C'(b,p) such that 

^>S x )<^j- 2 . 

Proof: Fix x > 1 and bound in two ways the quantity E \M tA s x \ p - First, since {M TA s xAn : 
n > 1} is a martingale, \x\ p is convex, and r A S x > 1 is a stopping time bounded above by 
x, we have 

E|M TA sJ p <E|M x |P. (5) 
Using Lemma 7 gives E|Mj; — Mi\ p < c p bx p / 2 , and since E|Mi| p < b, this yields 



E 



\M X \ P = \\M X \\ P < (\\M 1 \\ P +\\M X -M 1 \\ P ) P < {b 1 ^ + (c p bx p / 2 ) 1 / p y < c'(p,b)x p / 2 (6) 



with c'(p, b) := c p b(l + c p 1 ) p . On the other hand, on the event {r > S x } we have M tA s x 
Ms x > x, so that 

x p P(r > S x ) < E|M rAS J p , 
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and combining this with (5) and (6) gives 

P(t > S x ) < x~ p c'(p,b)x p/2 

which proves the result for x > 1. Finally, for x < 1 we use P(r > S* x ) < 1, so the lemma 
follows with C'(p, b) := max(l, c'(p, b)). □ 

PROOF of Theorem 4: Let T = mi{k > : A k > t/4} be the time of the first large jump. 
Since r > t implies S x < x for all x < t, we can write 

E ((M t + ) r l T>t ) = E ((M+fl G ) + E ((M t + ) r l H ) , (7) 

where G = {T > S t/2 < t < r} and H = {T < S t/2 < t < r}. 

To bound the first term, abbreviate S := S t /2 and begin by observing that M$ < 3i/4 
on G, since the level t/2 or higher has just been obtained and the increment was no more 
than t/4. Thus 

E ((M+y 1 G ) < P(r > S)E((M+) r l T>t | F s ). 
The first factor may be bounded via Lemma 8: 

ip i 2 a 

p (->*)<^- (8) 

The second factor is bounded using the formula 

/•OO 

E (Z r l z>u ) = u r P(Z >u)+ ry r - l V{Z > y) dy. (9) 

Ju 

By Lemma 7 conditionally on T s , E(|M t -M s \ p \ F s ) < c p b(t - S) p / 2 < c p bt p / 2 . Hence by 
Markov's inequality, P(M t - M s > y | F s ) < c p bt p / 2 /y p . Therefore, 

E((M+yi T>t |^) < E((M+r 

L M t >t 

ry r - 1 V{M t >y\F s )dy 

POO 

< t r P(M t -M s > t/4 \F S )+ rtf-^Mt -M s >y\F s ) dy 

Jt/A 

< c v b4 p t r - p l 2 + / ry r - x c hV t p l 2 y- p dy 

Jt/A 
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< C p b(4P + 4P-nf- P /2 

p — r 

< c 2 (b,p,r)f-v/ 2 , 

where C2(b,p,r) := c p b(4? + A p ~ r r/(p — r)). Combining with (8) gives 

E ((M+) r l G ) < 2 p l 2 c 2 Ct r ~ p . 



(10) 



We will bound the second term by decomposing according to the value of T. A prelimi- 
nary computation is to bound the quantity E((M t + ) r lT=k,M t >t I Fk)- Break this into three 
pieces: the part up to time k, the jump at time k, and the part from time k + 1 to time t. 
For any < r < p — 1, \x + y + z\ r < 3 r (|x| r + \y\ r + \z\ r ) (use convexity when r > 1 and 
sublinearity when r < 1). Hence 

{M+y < s r [{M+y + (A+y + «m, - M fc+1 )+r • 

The event {T = k} implies M k < 3i/4, and is also in the initital cr-field of the martingale 
{M n — Mfc +1 : n > k + 1}. Therefore, when we condition on T k , we get 

' /V\ r 

E((M t + yi T=k;Mt>t \F k ) < 3 r ^ T j P(A fc >t/4|^ fc ) 

+E((A+fl Afc > t/4 |^) 
+ E(|M t - M fc+1 j r l T=fc)Mt _ Mfc+1 >i/ 4 |.F fc ) 



The moment condition E(|Afc| p | T k ) < b implies that P(A^ > y) < by p , hence the first 
of these contributions is at most 

3 r (fr^r p . 

Using (9) again, we bound the second of the three contributions by 

3'(-fP(A fc > -\T k ) + y / ry r - 1 P(A k >y)dy 
4 4 Jt/A 

which is at most 



3 r b(-) r - p + 3 r — — (-) r ~ p . 
4 p-r 4 
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Lemma 7 implies E(|Mf — M k+ i\ r \ -7~fc+i) < c p bt r / 2 , while lx=k & 3~k+i and has conditional 
expectation at most b(t/4)~ p given Tk- Therefore the third contribution is bounded by 

3 r b(^)- p c p bt r / 2 . 

Summing the three contributions gives 

B((M+yi T=kMt>t I T k ) < C3 t r ~ p (11) 

where c 3 = 3 r 4Pb(c p b + ^ + 3 r ). 

Now we bound the second term, by decomposing according to the value of T. 



E 



[t/2j 

t -y± H J = J2 E ((M t + ) r l H lT=k) 

k=0 
[t/2j 

= ^E[E((M+) r l H l T=fc |^ fc ) 

k=0 

The event {r > A;} is in Tk and contains the event H D {T = k}, so we have 
E((M+) r l#l T=Jfc \F k ) < l T >fcE((M+) r l r=fc |^ fc ) 

and hence 

[t/2j 

E((M+) r l H ) < E P ( r > fc)E [E((M+) r )l T=fc l^ifc) 

fc=0 

Plugging in the upper bound (11) and using Lemma 8 gives 

[t/2j 

E((M t +) r l H ) < 2 C"fc- p / 2 c 3 t r ^ 

fc=0 

and summing yields a bound of 

E((M+) r l H ) < c 4 t r ~ p 



(12) 



(13) 



for the second term, where C4 := C'cs((p/2). By (7), the two bounds (10) and (13) together 
imply the conclusion of Theorem 4. □ 
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This completes the proof of Theorem 4, and hence also the proof of Theorem 1 and 
Corollary 3. 
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